Chapter 1 Multiple F 0 Estimation

نویسندگان

  • DeLiang Wang
  • Guy J. Brown
  • John Wiley
چکیده

This chapter is about the estimation of multiple fundamental frequencies (F0) from a waveform such as the compound sound of several people speaking at the same time, or several musical instruments playing together. That information may be needed to transcribe the music to a score, to extract intonation patterns for speech recognition, or as an ingredient for computational auditory scene analysis. The task of estimating the single F0 of an isolated voice has motivated a surprising amount of effort over the years [45]. Work on the harder task of estimating multiple F0s is now gaining momentum, fueled by progress in signal processing techniques on the one hand, and new applications such as interactive processing or indexing of music, multimedia and speech on the other. A multiple F0 estimation method is typically assembled from two elements: a singlevoice F0 estimator, and a voice-segregation scheme. Here “voice” is used in a wide sense to designate the periodic signal produced by a source (human voice, instrument sound, etc.). Some space is accordingly devoted the topic single voice F0 estimation, but the reader should refer to the excellent treatise of Hess [45] for more details. Segregation techniques too are evoked, but the reader should follow pointers to other chapters of this book wherever possible. A sound with a periodic waveform evokes a pitch that varies with F0, the inverse of the period [87]. The pitch may be salient and musical as long as the F0 is within about 30

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chapter 4 Nonparametric regression : minimax upper and lower bounds

We consider one of the two the most classical non-parametric problems in this example: estimating a regression function on a subset of the real line (the most classical problem being estimation of a density). In non-parametric regression, we assume there is an unknown function f : R → R, where f belongs to a pre-determined class of functions F ; usually this class is parameterized by some type ...

متن کامل

Investigation of linear and non-linear estimation methods in highly-skewed gold distribution

The purpose of this work is to compare the linear and non-linear kriging methods in the mineral resource estimation of the Qolqoleh gold deposit in Saqqez, NW Iran. Considering the fact that the gold distribution is positively skewed and has a significant difference with a normal curve, a geostatistical estimation is complicated in these cases. Linear kriging, as a resource estimation method, c...

متن کامل

Parameter estimation and linear time-series analysis

This computer exercise gives an introduction to time-series analysis and estimation and model choice using Maximum Likelihood. It is preferred if you use Matlab, but you are allowed to use the programming language or package of your choice. If you choose not to use Matlab, please note that you are required to document your code extra carefully. 1 Preparations for the exercise Read chapter 4 in ...

متن کامل

[11] Estimation of Local Receptor Density, B 0 Max , and Other Parameters via Multiple-injection Positron Emission Tomography Experiments

tem receptor assay in vitro. This Chapter attempted to convey that such in vivo studies have exactly the same goal as their in vitro counterparts, to distinguish receptor density from receptor–ligand affinity as determinants of the binding process. While considerable care and caution must be exercised in the performance and interpretation of such studies, our work to date strongly supports the ...

متن کامل

Multiple F 0 Estimation

This chapter is about the estimation of multiple fundamental frequencies (F0) from a waveform such as the compound sound of several people speaking at the same time, or several musical instruments playing together. That information may be needed to transcribe the music to a score, to extract intonation patterns for speech recognition, or as an ingredient for computational auditory scene analysi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005